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(57) Abstract 

In an embedded DCT-based (EDCT) image coding method, 
decoded images which give better PSNR over earlier JPEG and 
DCT-based coders are obtained by a scanning order starting, for 
each bitplane, from the upper left corner of a DCT block (corre- 
sponding to the DC coefficient) and transmitting the coefficients 
in an order of importance. An embedded bit-stream is produced 
by the encoder. The" decoder can cut the bit-stream at any point 
and therefore reconstruct an image at a lower bitrate. The quality 
of the reconstructed image at this lower rate is the same as if 
the image was coded directly at that rate. Near lossless recon- 
struction of the image is possible, up to the accuracy of the DCT 
coefficients. The algorithm is very useful in various applications, 
like WWW, fast browsing of databases, medical imaging, etc. 
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AN EMBEDDED DCT-BASED STILL IMAGE CODING ALGORITHM 

TECHNICAL FIELD 

The present invention relates to a coding algorithm for still images and to a device for 
executing the algorithm. The algorithm is particularly well suited for generating an 
embedded bit-stream and for coding parts of the image with different quality than the 
rest of the image. 

BACKGROUND OF THE INVENTION AND PRIOR ART 

Transform coding has been widely used in many practical image/video compression 
systems. The basic idea behind using a transformation is to make the task of 
compressing the image after transformation easier than direct coding in the spatial 
domain. The Discrete Cosine Transform (DCT) has been used as the transformation in 
most of the coding standards as JPEG, H261/H.263 and MPEG. 

In recent years most of the research activities have shifted from the DCT to the wavelet 
transform, especially after Shapiro published his work on embedded zerotree wavelet 
(EZW) image coding, see J. M. Shapiro, "Embedded Image Coding using zerotrees of 
wavelet coefficients", IEEE Trans, on Signal Processing, Vol. 41, No. 12, pp. 
3445-3462, Dec. 1993. 

The paper, W. B. Pennebaker, J. L. Mitchell, JPEG Still Image Data Compression 
Standard, Van Nostrand Reinhold, New York, 1993 describe the state of the art in 
DCT-based coding. 

In many applications it is desired to obtain an embedded bit-stream. Since an 
embedded bit-stream contains all lower rates embedded at the beginning of the bit- 
stream, the bits are ordered from the most important to the less important. Using an 
embedded code, the encoding simply stops when the target parameter as the bit count 
is met. In a similar manner, given the embedded bit-stream, the decoder can cease 
decoding at any point and can produce reconstructions corresponding to all lower-rate 
encoding. 

In order to make the embedded bit-stream optimal it is desired to transmit the bits, 
which are most significant for the visual perception of an image first. This corresponds 
to letting the bit-stream have a good compression/quality ratio at low bit rates. 

The DCT is orthonormal, which means that it preserves the energy. In other words, with 
respect to the root mean squared error (RMSE) (or peak signal-to-noise ratio - PSNR) 
an error in the transformed image of a certain magnitude will produce an error of the 
same magnitude in the original image. 

This means that the coefficients with the largest magnitudes should be transmitted first 
because they have the largest content of information. This also means that the 
information can also be ranked according to its binary representation, and the most 
significant bits should be transmitted first. 

After the DCT transformation, most of the energy of the image is concentrated in low 
frequency coefficients, and the rest of the coefficients have very low values. This 
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means that there are very many zeroes in the most significant bit planes (MSB) of the 
coefficients. Until the first significant bit (FSB) of a certain coefficient is found the 
probability of zero is very high. The task of efficient encoding therefore becomes the 
task of encoding these zeroes in an efficient way. 

In the Papers Z. Xiong, O. Guleryuz, M.T. Orchard, A DCT-based embedded image 
coder, IEEE Signal Processing Letters, Vol. 3, No. 11, pp. 289-290, Nov 1996 N K 
r^™ 6 ' DM Monro ' Embe dded DCT coding with significance masking" Proc IEEE 
ICASSP 97 , Vol. IV, pp. 2717-2720, 1997 and J. Li, J. Li, C.-C. Jay Kuo, Layered DCT 
still image compression, IEEE Trans. On Circuits and Systems for Video Technology 
Vol 7, No. 2, April 1997, pp. 440-442, although DCT is the transform used, the coding 
of the coefficients is not done in the way that JPEG coding is done. Instead an 
embedded bit stream can be produced. 

SUMMARY 

It is an object of the present invention to provide an algorithm and a device for 
executing the algorithm, which output an embedded DCT-based (EDCT) image from 
which the decoded images give better PSNR over those from JPEG and the'DCT- 
based coders published in the above cited papers. 

It is also an object of the invention to provide methods that can be used for encodinq 
Regions of Interest (ROIs) of the image with different quality than the rest of the imaqe 
while still keeping the embedded property of the algorithm. 

These objects are obtained by a method employing the following basic steps: 

- Partition the image into rectangular blocks. 

- Transform each block separately with the DCT. The transformation produces a block 
of coefficients with the DC coefficient in the upper left corner and higher horizontal and 
vertical frequency coefficients in the direction of the corresponding block axis 
Traditionally the blocks used are 8x8 in size, but here any power of two block size is 
allowed. The reason to restrict the size to a power of two is that then the DCT can be 
computed fast. 

- Quantize and transmit the DCT coefficients in a progressive manner, so that the most 
important information is transmitted first. 

However, in order to make the compression good at every transmission rate, i e so that 
he bit-stream can be cut at any time and still provide an image having a good quality at 
that compression ratio, an effective scan order needs to be used. 

Thus according to the invention, for each coefficient, the first non-zero bit starting 
from the most significant to less significant bits, is called the First significant bit (FSB) 
J-?n ? Tu° f 3 coefflc,ent P rior to tne firs t significant bit will be referred to as the Zero bits 
( 7 \J ne sign information is represented by the Sign bit (SB), while the rest of the bits 
after the first significant bit are called Raw bits (RBs). Coding is done bitplane by 
bitplane In each bitplane, the coding is from the lowest frequency coefficient to the 
highest frequency. 
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For every zero bit sent, the length of the uncertainty interval for that coefficient seen by 
the decoder is divided by two. When the first significant bit is encountered, the sign of 
the coefficient needs to be sent to maintain an embedded code, as will be described 
below. 

After the sign bit, the RBs have to be transmitted. These contain very little redundancy 
and there is very little to be gained by trying to encode these with a good prediction. 

The receiver can now reverse these steps. The produced bit-stream is embedded and 
the decoder can therefore cut the bit-stream at any point and generate an image having 
the same quality as if it was compressed directly at that bitrate. 

By using this scan order for such a coding scheme a very good visual quality is 
obtained for all compression ratios. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will now be described in more detail and with reference to the 
accompanying drawings, in which: 

- Fig. 1 illustrates how the DCT coefficients are arranged 

- Fig. 2 illustrates the bit order in a pixel 

- Fig. 3 illustrates the scan order for an embedded coding 
algorithm 

- Fig. 4 illustrates the sending of a sign bit 

- Fig. 5 illustrates the context for AC coefficient zero bits 

- Fig. 6 illustrates the context for DC coefficient zero bits 

- Fig. 7 illustrates the context for cut_off signals 

- Fig. 8 is a flow chart illustrating the coding procedure 

DESCRIPTION OF PREFERRED EMBODIMENTS 

In Fig. 1 the arrangement of the DCT coefficients in an 8x8 DCT block is illustrated. 
Thus, in the upper left corner the low frequency coefficients are found and the higher 
frequency coefficients are found down and to the right. 

Each coefficient is represented by a number of bits, for example 8 as is illustrated in 
Fig. 2. In Fig. 2 the first three bits are zero, called zero bits (ZB) the fourth bit is not 
zero and is called first significant bit (FSB) and the bits having lower significance are 
called raw bits (RB). 

In a preferred embodiment the coding algorithm for a digitized DCT transformed image 
is as follows: 

(1) Find the mean value (DC_mean) of all DC coefficients. Subtract this value from 
each DC coefficient. 

(2) Choose a quantizer that is half the size of the largest magnitude coefficient in the 
image. Transmit this quantizer. 

(3) Send/encode the information of which new coefficients are significant with respect 
to the current quantizer and also the sign of these coefficients. A coefficient is said to 
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be significant with respect to a quantizer if its magnitude is larger than the current 
quantizer (in absolute terms) is. 

(4) Subtract the current quantizer from the magnitude of the coefficients found to be 
significant in this bit plane. Replace the significant coefficients magnitude by its 
magnitude subtracted by the current quantizer. The difference corresponds to keeping 
only the raw bits. 

(5) For ail coefficients that have been significant in previous bit planes, send/encode 
the information of whether the coefficients have a larger or smaller magnitude than the 
quantizer. Subtract the quantizer from the magnitude of the ones that do and replace 
those coefficients by the resulted value. This corresponds to transmitting a raw bit. 

(6) Divide the current quantizer by two. This corresponds to going down to a less 
significant bit plane of the coefficients. 

(7) Repeat from step (3) until the bit budget is exhausted or some desired quality is 
reached. 

Notice that step 1 above is optional. If it is used, the mean value of the DC coefficients 
has to be stored/transmitted. 

The reconstruction is done as follows: 

- Set all coefficients to zero. 

- Receive the first quantizer. 

- Receive the information about the new significant coefficients. 

- Reconstruct these as (1 .5 * current quantizer * the coefficient sign). This is because at 
this stage it is known that the coefficient's magnitude is between current quantizer and 
(2 * current quantizer). This puts (1.5 * current quantizer) in the middle of the 
uncertainty interval. The addition or subtraction performed at step 5 below will update 
the coefficients so that they are always in the middle of the uncertainty interval. 

- For all previously significant coefficients, check if the coefficients have a larger 
magnitude than the current quantizer. Add the current quantizer/2 to the magnitude of 
the ones that do and subtract the current quantizer/2 from the magnitude from the ones 
that do not. 

- Divide the current quantizer by two. 

- Repeat from step 3, i.e. reception of new significant coefficients, until the desired 
quality is reached or no more information exists. 

If step (1) had been performed at the encoder, then the decoder has also received the 
mean value of the DC coefficients and this value is added to the reconstructed DC 
coefficients. 
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Furthermore, for updating the coefficients in every bit plane a scan order needs to be 
defined. In a preferred embodiment one coefficient is updated in all blocks before 
proceeding to the next coefficient. 

Inside a block, the DCT coefficients are scanned in a diagonal order, bit plane by bit 
plane. After each scanned diagonal, a flag is sent telling if there are any new significant 
coefficients in the rest of the block. This will be referred to as the block cut_off. The 
block cut_off is used because in the first bit planes there are so many zeroes that in 
practice an explicit symbol performs better than trying to code all the zeroes with a 
good prediction. The block cut off symbols only concern the new significant 
coefficients. 

Notice that instead of having a cut_off flag for each diagonal, run length coding 
methods could be used. For example, if a coefficient is significant, a number could be 
used to denote the position of the next significant coefficient in the block. 

As explained above a sign bit has already been sent for the previously significant 
coefficients. The uncertainty interval is therefore double as big for the coefficients not 
yet significant and these should be considered first in the new scan. Therefore, during 
each bitplane, first the significance identification is encoded and then the refinement 
quantization (steps 3 and 5 of the encoding process). 

In fig. 4 the reason for sending a sign bit after encountering the first significant bit is 
illustrated! Thus, after having sent the sign bit the sign of the coefficient is determined 
and the resulting coefficient is no longer ambiguous. 

During each bit plane, the scanning of the coefficients is preferably done in the 
following manner: first all DC coefficients, then all AC coefficients with the same index, 
in a diagonal order starting from the upper left corner and ending in the lower right 
corner of a DCT block of the image, i.e. DC, AC1, AC2, etc... 

The zig-zag scanning used in JPEG could also be used without affecting the property 
of embedded coding of the algorithm. 

An embedded bit-stream is in this manner produced by the encoder. The decoder can 
cut the bit-stream at any point and therefore reconstruct an image at a lower bitrate. 
The quality of the reconstructed image at this lower rate would be the same as if the 
image was coder directly at that rate. Near lossless and lossless reconstruction of the 
image is possible. 

The scan order used is illustrated in fig 8. Thus, first in a block 801 the procedure 
starts. Next in a block 803 a DCT transformation is executed. Thereupon the first bit- 
plane of the DCT coefficients is extracted in a block 805. The contexts are then reset in 
a block 807. 

Next, the first diagonal is extracted in a block 809 and the first coefficient is extracted in 
a block 811. The first block is then extracted in a block 813. Then in a block 815 it is 
checked if the block is already cut. If this is the case the procedure proceeds to a block 
817 and else to a block 819. 
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In the block 819 it is checked if the block should be cut off at this movement If this is 
the case the procedure proceeds to a block 821 in which a block cut off symbol is 
transmitted, else the procedure proceeds to a block 823. 

In the block 823 it is checked if the coefficient is previously significant, i.e. it is checked 
if this is the first significant bit (FSB). If the coefficient is previously significant the 
procedure proceeds to the block 817 and else to a block 825. 

In the block 825 it is checked if the coefficient is significant now. If this is the case the 
procedure proceeds to a block 827 and else to a block 829. 

In the block 829 data is sent containing information that the coefficient is not significant 
and from the block 829 the procedure proceeds to the block 817. 

Ln^f™ 0 ?^ da ?o ' S SSnt containin 9 information that the coefficient is significant 
and from the block 829 the procedure proceeds to the block 817 via a block 831 in 
which the sign is sent. 

In the block 817 it is checked if this is the last block of the DCT image If this is the 

hf^l ft^ Pr °? e 6 proceeds to a block 833 and else the procedure returns to the 
block 81 5 and repeats the process for the next block. 

In the block 833 it is checked is the current coefficient is the last coefficient in the 

SowS ' S *u ° aSe thS P rocedure Proceeds to a block 835 and else it returns to 
ine Diock 813 for the next coefficient. 

In the block 835 it is checked is this diagonal is the last diagonal. If this is the case the 
diagonal 6 Pr ° CeedS t0 3 block 837 and else ]t retums to tne b '°ck 811 for the next 

LVr^^u 837 thS refinement start s- Thus, in the block 837 the first diagonal is 
I I m J I h6 ?J n 3 b ' OCk 839 the firSt coefficie nt ^ the first diagonal is extracted. Next 
n a block 841 the first block is contemplated. Thereupon in a block 843 it is checked if 
the current coefficient is previously significant. If this is not the case the procedure 
proceeds to a block 847 and else the procedure proceeds to the block 847 via a block 
«4b, in which the raw bits are sent. 

In the block 847 it is checked if the current block is the last block. If this is the case the 
P h r rn e ex^blo P ck CeedS t0 3 b '° Ck 849 6iSe th6 procedure returns to the bloc k 843 for 

In the block 849 it is checked if the current coefficient is the last in the current diagonal 

L,?rne'f wf S ? l he P rocedure Proceeds to a block 851 and else the procedure 
returns to the block 841 for the next coefficient. 

raIH e J? '° Ck * S \ U iS ° heCked if the CUrrent diagonal is the ,ast Agonal. If this is the 

^S?qS^ Ure f!? CeedS t0 3 b ' 0Ck 853 and else the P roced ^e returns to the 
diock bJ9 for the next diagonal. 
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In the block 853 it is checked if the current bit plane is the last bit plane. If this is the 
case the procedure proceeds to a block 855, in which the procedure is terminated, and 
else the procedure returns to the block 807 for the next bit plane. 

To further clarify the scan order and the algorithm in general, pseudo code for the 
encoder is presented below: 

Codejmage { 
Do forward DCT; 

for(all bitplanes) 
{ 

reset_contexts; 

/*significance identification pass*/ 
for(alLdiagonals) 

{ 

for(all_coefficients_in_diagonal) 
{ 

for(all_blocks_not_cut_off_al ready) 
{ 

if(cut_off_position) send cut_off_symbol; 
if(coefficient_not_previousiy__significant) 

{ 

if (significant) 
{ 

send(significant); 
send(sign); 

} 

else send(not significant); 

} 

} 

} 

} - 

^refinement pass*/ 
for(all_diagonals) 

{ 

for(all_coefficients_in_diagonal) 
{ 

for(all_blocks) 
{ 

if(coefficient stgnificant_in__any_previous_bitplane) 

send(raw_bit); 

} 

} 

} 

} 

} 
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noted that it is possible to send them as raw bits and not encode them with arithmetic 
coding. This might reduce the performance but will increase the execution speed of the 
algorithm. 

The DC sign bit is coded in a context chosen by adding together the number of DC 
neighbours that are marked in the significance plane and have a positive sign as is 
illustrated in Fig. 6. The AC coefficients zero bits (and significant bit) can be coded 
taking into account 6 neighbouring coefficients in the block and the same coefficient in 
three neighbouring blocks as is illustrated in Fig. 5. The information in the significance 
plane for these coefficients, is used for the context. 

For the DC coefficient zero bits the context is chosen using the DC coefficients in all 
the neighbouring blocks. Also in this case the only thing considered is the significance 
plane. 

The block cut_off is coded in the context of the cut_off symbols in 4 neighbouring 
blocks as is illustrated by Fig. 7. The diagonal number is also taken into account. This 
is done by using the four bits of the related cut off symbols and the diagonal, which is a 
4 bit number and compose them into an 8 bit integer that indexes the context. 

The results can be improved, for example by using some suitable post processing 
algorithm. 

The computational complexity and memory requirements of the algorithm described 
herein are mainly due to the DCT computation and the use of arithmetic coding. 

Since the algorithm is block-based, parallel implementation is possible. In particular, 
the parallel implementation of the transformation stage is possible. Assume that P 
processors are available, to process an NxM image which is divided in 8x8 blocks, then 
each processor can do the DCT transformation in the blocks of the (N/P)xM part of the 
image. Furthermore, bitplanes can be processed independently and parallel. This is 
due to the fact that the contexts are restarted in each bitplane and that no grouping of 
coefficients is performed. All that needs to be known in a specific bitplane is which 
coefficients are significant previous of that bitplane. This information can be derived 
(also in parallel) from the DCT coefficients immediately after the transformation. Notice 
that the sign bit needs to be coded in a simple fashion. 

Notice that the algorithm is very well suited for progressive image transmission. During 
progressive image transmission, it is desired to provide lossless compression at the 
last stage of the transmission. Without any restriction on the accuracy of DCT, near 
lossless reconstruction can be achieved. Lossless reconstruction can be achieved by a 
lossy plus lossless residual encoding method in which case the proposed algorithm is 
used for achieving the lossy part of the compression. Notice that this approach requires 
an integer inverse DCT to ensure portability of software and hardware. 

In certain applications, particular regions of an image need to be coded with better 
quality compared to the other parts. The algorithm can be used for coding Regions of 
Interest (ROI) with different quality compared to the rest of the image. For example, if a 
ROI is to be coded at 1 bit per pixel (bpp) and the rest of the image at 0.5 bpp, then 
during a progressive image transmission, the image could be transmitted until the 0.5 
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o^.H S K ad ? ieVed 3nd then inforrTi ation to reconstruct the ROI at an additional 0 5 boo 
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have arbitrary size. In this case the methods used in MPEG 4 can be appHed (Sock 

Quired to m a a k P e e it A w P i iVe t ? CT 2- S ° me m ° dif iC3ti0nS t0 the a, 9° rithm ^hS^S 
2 Tn ° w W w 0rk Wlth arb,trary R0I ' S Various schemes for progressive codino 

Tha^he RoTs'hao^ ^ be '° W , N ° tiC6 f ° r sim P |ificati °" P«Po~S it is assumed 
hat the ROI shape is a union of rectangles where the dimensions of the rectanqles are 
the ones used for DCT coding (i.e. 8x8 pixels in most applications using JPEG) 

bppTS D a ffir e e nt b th te f0f back 9 round is x ^Ixel (bpp) and for the ROI is y 
imaae where ^ hLl T US6d t0 3Chieve Passive transmission of the 

mage where the background is coded at x bpp and the ROI (or more ROi's^ at v, hnn 
(the bitrate for different ROI's can be different) > V bpP 

Below, three different schemes for obtaining such different bitrates are outlined. 

ieDaratelv S ^?h^* h l baCk9r T d (BG) 3nd the region of interest (ROI ) are coded 
sXT^ m ' 9ht T T? in S ° me Pr ° blemS f0r Passive transmission, 

diSLnt hSS are ° 0ded se P aratelv . there must be a way to pack the 2 

rr^AKSv 1 '^ 6 S ° tC \ aChi6Ve P r °9 ressive transmission oMhe whole 
TnH 9 L It J V ,'r! he P r °9 ressive transmission will result in BG or ROI send first 

JE^^n^E?' ° f BG) l3St TNS meanS that the ^ceiver c?nnor g et an 
idea of what the whole image is at an early stage of the transmission. 

Tin^T 0 ' 6 ' corr ^ ,a i ion between neighbouring blocks in BG and ROI is not exploited 
m2fh h V h are C ° d6d se P arate| y- Thi s will reduce the compression performance The 
method is however good for parallel processing. perrormance. me 

^m^ d H SCh n me th , B R0 ' coefficien ts are shifted or multiplied by a certain number 
ItJno Z r" rSSUlt in th3t the RO ' coeffic ^s are found significant at thTeariy 
more th°J { * e < rans ™*™ a " d therefore to be coded first, as compared to the BG The 
S i S C ° efflc,ents are shifted - the earlier the ROI coefficients will be found 
significant. Therefore, it is possible to control the speed reconstruction of The ROI The 

foMhe'RO, ltl« kn °T th ^ R °' Sh3pe 3nd ,OCati ° n 35 we " as the shif°ng factor uled 
Ss ream an the Z^L ^ "^T^u is stored at the header information of the 

nw^jRoTr?^ d 11 Wh6n r6CeiVing the bitstrea ™- Notice that the 

number of ROIs can be arb.trary and the shifting (multiplying) factors of the ROI 

Ksiysr for different ro,s - in this case - u is p ° ssibie » 

i«mf e ,l hat 1 th . S coefficients are lifted so much that the smallest ROI coefficient is 
^^^RO?^ 6 C ° em r?> H th6n a " R0 » COeffici - ts will be coded tsl Th 
Ascribed below^ 2 be re f° nstructed completely first (even up to lossless as will be 
WhS „!l£ I J' f ° ' n th,S Case the R01 sha P 6 is not need ed to be transmitted 
t^at l^Vrl « e tra ? m ' tted is the shifting factor and a signal to the decode? teTng 
hat that the coeff.c.ents that have been received are ROI coefficients (alternative!! at 
the .beginning of the header, the number of bytes that correspond to the RO 
rZ« ! C,en s , can tu be add * d so that the decoder knows when to stop decodinq RO 
coefficents). In this manner, there is no need to transmit the shape of the ROL since 
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the decoder can find it. This might be important when the shape information is to 
occupy significant amount of bits (which is the case for arbitrary shaped regions). The 
saved bits can be used for better coding the ROI or the background. When the ROI 
bitrate is achieved, the decoder will know what will be received for the background. 

Notice that the above scheme is not only applicable in DCT-based coders. Wavelet 
based coders can also use a similar scheme in order to avoid transmitting the shape 
information of the ROI's. The scheme can be used for example in Nister D., 
Christopoulos C.A., " Progressive lossy to lossless coding with a Region of Interest 
using the Two-Ten integer wavelet ", ISO/IEC JTC1/SC29/WG1 N744 Geneva 
Switzerland, 23-27 March 1998. 

It is also to be noted that this way of transmitting the shape of the ROI can be 
employed for any transmission scheme transmitting bitplanes in an order of decreasing 
significance. The only requirement is that the ROI coefficients are shifted with a factor 
large enough for them to be transmitted before any coefficient of the background. The 
method will work both for transformed images, e.g. DCT or wavelet transformed, as well 
as for ordinary bit plane coded images. In order for the receiver to know when the ROI 
coefficients information on when the last ROI coefficient has been transmitted must be 
transmitted to the receiver. This can be carried out in a number of different ways. For 
example, a signal can be transmitted, a bit plane consisting of only zero-valued 
coefficients, or a header in the beginning of the transmission indicating the number of 
bits used for the ROI coefficients. 

Thus, only ROI is reconstructed at the early stages of the transmission. This makes it 
difficult for the receiver to know what the rest of the image looks like. This method is 
therefore good for speed reconstruction of the ROI. However, it avoids exploiting 
correlation between neighbouring blocks in BG and ROI at the early stages. 

According to a third scheme all of the image is coded until the background bitrate (or 
desired quality) is achieved and then only the ROI is continued to be coded. The 
complexity of such a scheme remains similar to the one of the system described herein. 
Thus, there is no dynamic range increase, correlation between neighbouring blocks is 
exploited (at least until the BG rate is achieved). Furthermore it is possible that after 
the switch rate (from BG rate to improving the ROI) the remaining bits of the DCT 
coefficients in ROI will be raw bits. This means that there is little to gain by exploiting 
the correlation between these bits and then they can be coded even without an entropy 
coder. 

Also, when using such a scheme a signal to the receiver needs to be sent when the BG 
rate is obtained, and that the ROI now is to be improved only. The decoder also needs 
to know the ROI shape. This method thus results in slower ROI reconstruction 
compared to the second scheme, but in all cases the receiver has an idea of the whole 
image. 

The receiver can change the ROI location during the transmitting stage. When the 
receives gets an image in a progressive transmission manner, it can specify a 
particular ROI that he wants to see fast and/or with better quality than the rest of the 
image. The location and shape of the ROI is send to the transmitter. If the transmitter 
has the image in compressed form, it does a variable length decoding (arithmetic 
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TlZ 6 tVll Ur S TT ] in ° rder t0 find °* ihe coefficients that correspond to the Rof 

th» a r^n t0 ? S n ? ed ,hat ,he al90rithm as ^Scribed herein is assumed to compress 
^th^cS^ imerlea " in9 ' - -»edd d e'd° b C = 

mrte°XeZ?e^!?E0 3 ^« "^1 °? ° CT ' minim3 ' ™ difi ~«°ns have ,o be 
t^T^^ 

Bz&$£^« SMS* S^^srss 

thTrff^ this case the opportunity of lossless coding is achieved and 

proposed sc°h™?7 lransmissi ° n fr °™ 'o»sy to lossless can bl achfeved Te 

property of me Sgomhm ' ' U " y proaress " e b * maintaining the embedded 
SSS^S? "ft?"" 2 ~ - - -co" d ed 

deirSSH ! - ^"xr- 

examrte in H 261 H 7 « h . U ^ f0r COd,n9 bo,h in,ra and intsr macroblocks (for 
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CLAIMS 



1 . A method of compressing a digitized image comprising the steps of: 

- division of the image into blocks 

- Discrete Cosine Transformation (DCT) of each block of the image, 

- representation of each DCT block as a binary bit-plane, 

characterized in that the DCT coefficients are scanned and transmitted in an order of 
decreasing bit-plane significance and, for each bit-plane, in a diagonal scan order 
starting from the upper left corner and ending in the lower right corner of each DCT 
block. 

2. A method according to claim 1, characterized in that signals, cut_off signals, are 
transmitted when there are no more coefficients are to be transmitted for a particular 
block. 

3. A method according to any of claims 1 or 2, characterized in that the output bits are 
coded by means of arithmetic coding. 

4. A method according to claim 3, characterized in that the raw bits are not coded by 
arithmetic coding. 

5. A method according to any of claims 1-4, characterized in that one coefficient is 
updated in all blocks before proceeding to the next coefficient. 

6. A method according to any of claims 1-5, when the image to be compressed 
comprises region of interest (ROI) coefficients, characterised in that the ROI 
coefficients are shifted or multiplied with by a certain number. 

7. A method according to claim 6, characterised in that ROI information is transmitted 
as header information in the beginning of the transmission. 

8. A method according to any of claims 6 or 7, characterised in that coefficients of 
different region of interest are shifted/multiplied by different factors. 

9. A method according to claim 6, characterised in that ROI coefficients are shifted so 
that they can be identified by a receiver without any additional information. 

10. A method according to any of claims 1-5, when the image to be compressed 
comprises region of interest (ROI) coefficients, characterised in that all of the image is 
coded until a bitrate determined for background information is reached and that then 
only region of interest coefficients are continued to be coded. 

11. A method according to claim 10, characterised in that a signal is transmitted to the 
receiver when the background bitrate is achieved, 

12. A method according to any of claims 10 or 11, characterised in that a receiver 
receiving the compressed image can change the ROI during transmission of the image. 

13. A method of transmitting an image comprising a region of interest (ROI), without 
transmitting the shape of the ROI, characterised in that the ROI coefficients are 
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shifted before the transmission and that the bits of the image are transmitted in an 
order of decreas.ng bitplanes so that the ROI coefficients are transmitted ffrTt 

14. A method according to claim 13, characterised in that a signal is transmitted to a 
receiver .nforming the receiver when the transmission of ROI co^dentJhas^rSed 

15. A device for compressing a digitized image comprising means for 

- dividing the image into blocks 

- Discrete Cosine Transforming (DCT) each block of the image 

- representing each DCT block as a binary bit-plane 

decre^rnn 2 hft ^H*™ ^ scannin 9 and transmitting the DCT coefficients an order of 
startTnc ?£Sm s, 9 n ' fica ™e and, for each bit-plane, in a diagonal scan order 

starting from the upper left corner and ending in the lower right corner of each DCT 

cut off^nntf C °h din ?K t0 C ' aim 15 ' characterised by means for transmitting signals 
cut.off signals, when there are no more coefficients to be transmitted for a particular 

th 7 ; ^i7^ 0r(i]n9 t0 any ° f Claims 15 or 16 ' characterised by means for coding 
the output bits by means of arithmetic coding. 9 

Innmo^H 6 acC ° rdin 9 to claim 17 - characterised by means for coding the raw bits by 
another coding algorithm than arithmetic coding. V 

IL^^ a > CCOV n2? *? any ° f Claims 15 " 18 ' characterised by means for updating 
one coeff.c.ent in all blocks before proceeding to the next coefficient. 

20. A device according to any of claims 15-19, when the image to be compressed 
•comprises reg.on of interest (ROI) coefficients, characterised in that Te rS 
coemcents are shifted or multiplied with by a certain number. 

fnL™ "I®* 06 acc ° rdin 9 to claim 20, characterised by means for transmitting ROI 
mformat.on as header information in the beginning of the transmission. 

22. A device according to any of claims 20 or 21, characterised by means for shiftina 
coefficients of different region of interest by different factors. 9 

so thaft^^ £ C l a l m 2 °' characterised in *at ROI coefficients are shifted 

so that they can be identified by a receiver without any additional information. 

24^A device according to any of claims 15-19, when the image to be compressed 
comprises region of interest (ROI) coefficients, characterised by means for codinq me 

IhTrfafterol 3 T** for back 9 round -formation is reached and" for 

thereafter only coding region of interest coefficients. 

25^ A device according to claim 24, characterised by means for transmitting a signal to 
the rece.ver when the background bitrate is achieved. 9 9 



SUBSTITUTE SHEET (RULE 26) 



WO 99/16250 




PCT/SE98/01070 



15 



26. A device according to any of claims 24 or 25, characterised by means for 
receiving information from a receiver receiving the compressed image changing the 
ROI during transmission of the image. 

27. A device for transmitting an image comprising a region of interest (ROI), without 
transmitting the shape of the ROI, characterised by means for shifting the ROI 
coefficients before the transmission and for transmitting the bits of the image are in an 
order of decreasing bitplanes so that the ROI coefficients are transmitted first. 

28. A device according to claim 27, characterised by means for transmitting a signal to 
a receiver informing the receiver when the transmission of ROI coefficients has ended. 
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